Population of transgenic plants individually comprising distinct codogenic gene segments, the population having at least 50% of the codogenic gene segments from a donor organism

ABSTRACT

The present invention relates to populations of trangenic plants encompassing a substantial part of all codogenic gene segments of a donor organism, and to biological material derived therefrom, plasmid collections and populations of transformed host organisms with which plants can be transformed in a suitable manner. There are also described methods for generating the plants and the material, and the use of the plants and of the material for functional studies. The codogenic gene segments are integrated into the genome of the plants. For example, there are described a population of plants of the species  Arabidopsis thaliana  into whose genome the codogenic gene segments from  Saccharomyces cerevisiae  are integrated, and their morphological analysis under normal conditions and stress conditions.

RELATED APPLICATIONS

This application is a national stage application (under 35 U.S.C. 371) of PCT/EP2003/00922 filed Mar. 18, 2003, which claims benefit of German application 102 12 158.3 filed Mar. 19, 2002.

FIELD OF THE INVENTION

The present invention relates to populations of trangenic plants encompassing a substantial part of all codogenic gene segments of a donor organism, and to biological material derived therefrom, plasmid collections and populations of transformed host organisms with which plants can be transformed in a suitable manner. There are also described methods for generating the plants and the material, and the use of the plants and of the material for functional studies.

DESCRIPTION OF THE BACKGROUND

The rising world population and diminishing areas for cultivation mean that there is a need for sustainable production. Genomic information may be exploited in this context for the optimization of production processes, in partricular in chemistry, in the production of foodstuffs and in agriculture. Today, enormous amounts of genomic information are available. However, this predominantly takes the form of sequence information with indirect functional assignment, if any.

It has already been attempted to express individual genes from various organisms in plants for a variety of purposes. In these experiments, it was attempted to study the function of a particular gene in the plant and its effect on the physiology of the plant.

Approaches regarding the expression of a plurality of genes of a specific donor organism in a model plant have as yet only been limited to transient expression systems. Such an approach is described, for example, in WO 99/36516 and WO 01/07600, where a cDNA library derived from the donor organism is inserted into a suitable vector of a plant virus, whereby rapid and strong expression of the cDNA can be achieved after infecting the host plant with said vector. However, only those cDNAs which exist in the original library can be expressed in the plants, but significant amounts of the genes of a genome are expressed only very weakly or under highly specific conditions, if at all, and therefore not covered by such an approach. Another disadvantage of this procedure is that the infection causes effects. These effects may influence the results of physiological studies on the infected plants. Another weakness, which is likewise inherent in transient expression, is that the availability of the transfected material is only transient. Thus, the periods available for analytical studies are only relatively short, while developmental and prolonged effects are not identified. Moreover, when viral vectors are used, the size of the sequences which can be incorporated is limited owing to the fact that the sequences must be packaged in the viral coat.

SUMMARY OF THE INVENTION

It is an object of the present invention to provide plants which are suitable for systematically carrying out functional studies into the function of foreign genes over the entire developmental cycle of the plant and, if desired, even over more than one generation.

This object is achieved by the subject of the present invention, which is a population of transgenic plants comprising at least 50% of all codogenic gene segments of a donor organism, the codogenic gene segments being integrated in the gename of the plants.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1A-1D1 lists 5393 codogenic gene segments from Saccharomyces cerevisiae.

FIG. 2A-2N lists codogenic gene segments from E. coli.

DESCRIPTION OF THE INVENTION

The term “codogenic gene segment” refers to a nucleic acid. Nucleic acids are formed from monomers (nucleotides) and contain sugars, phosphate and either a purine or pyrimidine or their derivatives. They include DNA and RNA sequences which can be single- or double-stranded and can optionally contain synthetic, unnatural or modified nucleotide bases which can be incorporated into DNA or RNA.

In particular, the term “codogenic gene segment” refers to the coding sequence, i.e. the part of a gene which encodes a protein, polypeptide or part thereof.

The term “gene” refers to a DNA sequence which encompasses both the codogenic gene segment, that is to say in particular the coding sequence, and regulatory elements.

The structural gene which encompasses the codogenic gene segments can have a continuous coding sequence (open reading frame, abbreviated to ORF) or may contain one or more introns which are linked to the exons via suitable splice junctions.

The codogenic gene segments according to the invention are normally found in the cells of the donor organism. These sequences are therefore autologous with regard to the donor organism. In contrast, these codogenic gene segments are normally not found in the plants on which the populations according to the invention are based (recipient plant), unless donor organism and recipient plant are identical with regard to the codogenic gene segment. With regard to the recipient plant, these sequences are therefore heterologous or sequences which, while they can be found in the recipient plant, are derived from another individual. Accordingly, parts of the genome of the recipient plant can be homologous to or identical with a specific integrated codogenic gene segment.

What is critical for the present invention is the integration of the codogenic gene segments in the genome of the plants, where a specific codogenic gene segment can be integrated in the form of a continuous coding sequence (ORF) or may contain one or more introns. If this is the case, such sequences are, as a rule, spliced while being expressed by the plant, it being possible, but not imperative, that the splice pattern corresponds to that of the donor organism.

In principle, the codogenic gene segments can be integrated into the extranuclear genome, for example the plastid genome, of a plant. However, preferred in accordance with the invention is integration into the nuclear genome.

It is preferred in accordance with the invention stably to integrate sequences encompassing the heterologous codogenic gene segments into the genome of the plants. This entails one or more of the following aspects:

-   -   during the life cycle of a plant, the number of copies of a         specific codogenic segment per cell is essentially constant;     -   the number of copies of a specific codogenic gene segment per         cell can be determined;     -   being a trait of the plant, the codogenic gene segment is         heritable and, in the case of nuclear integration, subject to         Mendelian inheritance.

As a rule, the number of integrated copies of a specific codogenic gene segment per cell amounts to less than 20 and in most cases to less than 10. Preferred in accordance with the invention are plants with cells comprising approximately 1 to 5 copies and in particular 1 copy of a specific codogenic gene segment. The number of copies per cell can be determined in a manner known per se by “Southern blot” analysis (extraction of the genomic DNA, restriction digest, electrophoretic separation, transfer to membranes, hybridization with labeled DNA-specific probe or quantitative PCR).

In accordance with the invention, the life cycle of a plant advantageously comprises all of the developmental stages from germination to seed maturation. For example, the life cycle of Arabidopsis thaliana encompasses epigeal germination, the development of the seedling into a rosette, the development of the main shoot from the rosette, branching of the main shoot, the development of flowers on the shoots, the flowering process, and seed maturation. Under favorable conditions, Arabidopsis thaliana can be propagated seed to seed within approximately 6 weeks.

In accordance with a particular embodiment of the present invention, the heterologous codogenic gene segments in the genome of the transgenic plants are flanked unilaterally or, preferably, bilaterally, by T-DNA sequences, in particular by agrobacterial Ti plasmid sequences. This is likewise an aspect of the stable integration, according to the invention, of the codogenic gene segments into the genome of the plants.

Populations according to the invention of transgenic plants comprise at least 50% of all of the codogenic gene segments of a particular donor organism. Populations of transgenic plants with at least 70% and in particular with at least 90% of all of the codogenic gene segments of a specific donor organism are preferred.

In addition, the codogenic gene segments of a donor organism are generally characterized by the function of their expression products. These include, in particular, functions in the aspects metabolism, energy, transcription, protein synthesis, protein processing, cellular transport and transport mechanisms, cellular communication and signal transduction, cell rescue, cell defence and cell virulence, regulation of the cellular environment and interaction of the cell with its environment, cell fat, transposable elements, viral proteins and plasmid proteins, cellular organization control, subcellular localization, regulation of protein activity, proteins with binding function or cofactor requirement, and transport facilitation. Codogenic gene segments with the same function are combined to what are known as functional gene families.

Metabolic functions relate in particular to the amino acid metabolism, the nitrogen and sulfur metabolism, the nucleotide metabolism, the phosphate metabolism, the carbon and carbohydrate metabolism, the fat, fatty acid and isoprenoid metabolism, the metabolism of vitamins, cofactors and prosthetic groups, and the secondary metabolism.

Energetic functions relate in particular to glycolysis and glyconeogenesis, the pentose phosphate metabolism, the citric acid cycle, the electron transport and the membrane-associated energy storage, respiration, photosynthesis, fermentation, the metabolism of energy reserves, the glyoxylate cycle and fatty acid oxidation.

Functions regarding transcription affect mainly the transcription of rRNA-, tRNA and mRNA, RNA transport and transcript processing.

Functions regarding protein synthesis affect mainly ribosomal biogenesis, translation, translation control and aminoacyl-tRNA synthetases.

Functions regarding protein processing affect mainly folding and stabilization, targeting, sorting and translocation, and the modification of proteins, the assembly of protein complexes and the proteolytic degradation of proteins.

Functions regarding cellular transport and transport mechanisms affect mainly the nuclear, mitochondrial, vesicular, peroxysomal, vacuolar and extracellular transport, exocytossis and secretion, endocytosis, the cellular import and the cytoskeleton-dependent transport.

Functions regarding cellular communication and signal transduction affect mainly intracellular signaling and signal reception, and transmembrane signal transduction.

Functions regarding cell rescue, cell defence and cell virulence affect mainly stress response, detoxification and the degradation of foreign (exogenous) substances.

Functions regarding the regulation of the cellular environment and interaction of the cell with its environment affect mainly ionic homeostasis and cellular perception and response.

Functions regarding cell fate affect mainly the cell cycle and cell growth, cell morphogenesis, cell differentiation, cell death and cell senescence.

Functions regarding the cellular organization control affect mainly the cell wall, the plasma membrane, the cytoplasm, the cytoskeleton, the endoplasmic reticulum, the golgi apparatus, the plastids, the intracellular transport vesicles, the nucleus, the mitochondria, the peroxysomes, the endosomes and the vacuoles or lysosomes.

Functions regarding the subcelluloar localization affect in particular the cell wall, the plasma membrane, the cytoplasm, the cytoskeleton, the centrosomes, the endoplasmic reticulum, the golgi apparatus, intracellular transport vesicles, the nucleus, mitochondria, peroxysomes, endosomes, vacuoles or lysosomes, extracellular or secreted proteins, and prokaryotic cell membranes.

Functions regarding the regulation of the protein activity affect mainly the targets to be regulated.

Functions regarding proteins with binding function or cofactor requirement affect mainly protein binding and lipid binding.

Functions regarding transport facilitation affect mainly channel/pore type transporters, ion transporters, carbon and carbohydrate transporters, amino acid transporters, peptide transporters, lipid transporters, nucleotide transporters, allantoin and allantoate transporters, transporters of active substances, and transport mechanisms.

Particularly important in accordance with the invention are functions regarding metabolism and energy, in particular enzymes of the primary and secondary metabolism, for example P450 enzymes, regarding transcription, in particular transcription factors, and regarding cellular transport and transport mechanisms, in particular channels and transporters.

Thus, populations of transgenic plants which are preferred are in particular those which have at least one subpopulation comprising at least 55% and in particular at least 80% of all of the codogenic gene segments of a donor organism which can be assigned to a functional gene family.

As a rule, the codogenic gene segments of a specific donor organism are publicly available. Those which must be mentioned in particular are public gene databases such as the EMBL database (Stoesser G. et al., Nucleic Acids Res 2001, Vol. 29, 17-21), the GenBank database (Benson D. A. et al., Nucleic Acids Res 2000, Vol. 28,15-18), or the PIR database (Barker W. C. et al., Nucleic Acids Res. 1999, Vol. 27, 39-43).

Moreover, organism-specific gene databases can be used, for example the SGD database (Cherry J. M. et al., Nucleic Acids Res. 1998, Vol. 26, 73-80) or the MIPS database (Mewes H. W. et al., Nucleic Acids Res. 1999, Vol. 27, 44-48) for yeast, the GenProtEC database Riley M., Nucleic Acid Res. 1998, Vol. 26(1), 54) for E. coli, the TAIR database (Huala, E. et al., Nucleic Acids Res. 2001 Vol. 29(1), 102-5) or the MIPS database for Arabidopsis.

Using this database information, nucleic acids encompassing the codogenic gene segments can be obtained in a manner known per se from suitable sources and provided for the subsequent integration in the plant genomes.

The donor organisms include both prokaryotic and eukaryotic organisms such as viruses, bacteria, yeasts, fungi, algae, plants and animals.

If plants are chosen as donor organism, the plant can, in principle, have any phylogenetic relationship with the recipient plant. Thus, donor and recipient plants may belong to the same family, genus, species, variety or line, with increasing homology between the codogenic gene segments to be integrated and the corresponding parts of the genome of the recipient plant. Donor organisms which are preferred in accordance with the invention are microorganisms, in particular those whose genomes have been sequenced, for example Acetobacter (subgen. Acetobacter) aceti; Acidithiobacillus ferrooxidans; Acinetobacter sp.; Actinobacillus sp; Aeromonas salmonicida; Agrobacterium tumefaciens; Aquifex aeolicus; Arcanobacterium Pyogenes; Aster yellows phytoplasma; Bacillus sp.; Bifidobacterium sp.; Borrelia burgdorferi; Brevibacterium linens; Brucella melitensis; Buchnera sp.; Butyrivibrio fibrisolvens; Campylobacter jejuni; Caulobacter crescentus; Chlamydia sp.; Chlamydophila sp.; Chlorobium limicola; Citrobacter rodentium; Clostridium sp.; Comamonas testosteroni; Corynebacterium sp.; Coxiella burnetii; Deinococcus radiodurans; Dichelobacter nodosus; Edwardsiella ictaluri; Enterobacter sp.; Erysipelothrix rhusiopathiae; Escherichia coli; Flavobacterium sp.; Francisella tularensis; Frankia sp. Cpl1; Fusobacterium nucleatum; Geobacillus stearothermophilus; Gluconobacter oxydans; Haemophilus sp.; Helicobacter pylori; Klebsiella pneumoniae; Lactobacillus sp.; Lactococcus lactis; Listeria sp.; Mannheimia haemolytica; Mesorhizobium loti; Methylophaga thalassica; Microcystis aeruginosa; Microscilla sp. PRE1; Moraxella sp. TA144; Mycobacterium sp.; Mycoplasma sp.; Neisseria sp.; Nitrosomonas sp.; Nostoc sp. PCC 7120; Novosphingobium aromaticivorans; Oenococcus oeni; Pantoea citrea; Pasteurella multocida; Pediococcus pentosaceus; Phormidium foveolarum; Phytoplasma sp.; Plectonema boryanum; Prevotella ruminicola; Propionibacterium sp.; Proteus vulgaris; Pseudomonas sp.; Ralstonia sp.; Rhizobium sp.; Rhodococcus equi; Rhodothermus marinus; Rickettsia sp.; Riemerella anatipestifer; Ruminococcus flavefaciens; Salmonella sp.; Selenomonas ruminantium; Serratia entomophila; Shigella sp.; Sinorhizobium meliloti; Staphylococcus sp.; Streptococcus sp.; Streptomyces sp.; Synechococcus sp.; Synechocystis sp. PCC 6803; Thermotoga maritima; Treponema sp.; Ureaplasma urealyticum; Vibrio cholerae; Vibrio parahaemolyticus; Xylella fastidiosa; Yersinia sp.; Zymomonas mobilis.

In accordance with a particular embodiment of the present invention, the donor organism is a yeast, preferably of the genus Saccharomyces, in particular Saccharomyces cerevisiae.

Accordingly, a very particular embodiment of the present invention relates to populations of transgenic plants encompassing at least approximately 3000, preferably at least approximately 4500 and advantageously at least approximately 5500 codogenic gene segments from Saccharomyces cerevisiae, assuming that the approximate total of codogenic gene segments amounts to 6300 (Mewes et al., Nature 387 (Suppl) July 65, 1997). In particular, the codogenic gene segments are selected from among the codogenic gene segments shown in FIG. 1.

In accordance with a further particular embodiment, the donor organism is a bacterium of the genus Escherichia, preferably E. coli.

Accordingly, a further very particular embodiment of the present invention relates to populations of transgenic plants encompassing at least approximately 2000, preferably at least approximately 3000 and advantageously at least approximately 3500 codogenic gene segments from E.coli, assuming that the approximate total of codogenic gene segments amounts to 4300. In particular, the codogenic gene segments are selected from among the codogenic gene segments shown in FIG. 2.

Accordingly, populations according to the invention of transgenic plants are composed of several individual plants. These individual plants can be distinguished at least at the molecular level. Thus, for each codogenic gene segment from the totality of the codogenic gene segments of a particular donor organism which are encompassed by the population according to the invention of transgenic plants, there is at least one individual of the population with this codogenic gene segment. Accordingly, a population according to the invention of transgenic plants, which encompasses a particular number of a variety of codogenic gene segments of a donor organism, contains at least the same number of individual plants, of which each individual plant is characterized in that it contains a particular gene segment of said codogenic gene segment, while not containing the remaining said codogenic gene segments.

In addition, populations according to the invention can encompass at least one further individual plant in whose genome a specific codogenic gene segment of a donor organism is integrated in combination with a further codogenic gene segment or a plurality of further codogenic gene segments of the donor organism.

As a rule, the codogenic gene segments are linked operably with regulatory sequences in the genome of the plants. It is preferred within a population of transgenic plants to use analogous regulatory sequences for all of the heterologous codogenic gene segments.

As a rule, operable linkage of a specific codogenic gene segment with one or more regulatory sequences comprises a chemical fusion of two or more DNA fragments in suitable orientation, for example in sense or antisense orientation, so that a suitable reading frame is maintained or created by the fused sequences and expedient regulation of the expression of the DNA sequences in the plant cell is ensured.

The term “expression” refers to the transcription and/or translation of a codogenic gene segment. As a rule, the resulting product is a protein. However, expression also includes the transcription of a DNA which is inserted in antisense orientation relative to regulatory elements, thus creating an antisense mRNA. The products also include ribozymes. Expression can be systemic or local, for example limited to specific cell types, tissues or organs.

In principle, the regulatory sequences can take the form of sequences of the recipient plants according to the invention, sequences of the donor organism, sequences of a further organism or synthetic sequences, as long as the function of these sequences in the plants according to the invention is ensured.

As a rule, regulatory sequences are arranged upstream (5′), within and/or downstream (3′) relative to a specific codogenic gene segment. In particular, they govern the transcription and/or translation and the transcript stability of the codogenic gene segment, if appropriate in conjunction with further homologous functional systems, such as the cellular protein biosynthesis apparatus.

Regulatory sequences include, above all, sequences arranged upstream (5′), which affect in particular the regulation of transcription initiation, such as promoters, and sequences arranged downstream (3′), which affect above all the regulation of transcription termination, such as polyadenylation signals.

Promoters predominantly govern the expression of the codogenic gene segment by acting as the attachment site for RNA polymerases and/or further factors required for an appropriate transcription initiation.

In principle, all promoters which are capable of stimulating the transcription of genes in plants can be employed. Suitable promoters which are functional in plants are generally known. They may be constitutive or inducible promoters. Suitable promoters may also be developmental- and/or tissue-specific, such as leaf-, root-, seed- and fruit-specific promoters.

Promoters which have proven particularly useful are promoters from phytopathogenic viruses and bacteria. Examples are the CaMV 35S promoter, the FMV 34S promoter, or a promoter from Cassava Vein Mosaic Virus (CsVMV), and, for example, a variety of promoters from Agrobacterium tumefaciens, such as the octopine synthase promoter (ocs), the nopaline synthase promotor (nos), or the mannopine synthase promoter. Moreover, a large number of plant promoters are available for the expression of genes in specific tissues, such as roots, flowers or stomata and above all seeds, such as, for example, the arc5 promoter from bean, the LeB4 promoter and the USP promoter from Vicia faba and the DcG3 promoter from carrot. A variety of promoters are also available above all for expression in seeds. Further seed-specific promoters are the sucrose binding protein promoter (WO 00/26388), the phaseolin promoter and the napin promoter. To express heterologous sequences strongly in as many tissues as possible, in particular also in leaves, it is preferred to use plant promoters of actin and ubiquitin genes, such as, for example, the rice Actin1 promoter besides various of the abovementioned viral and bacterial promoters. A further example of constitutive plant promoters are the sugar beet V-ATPase promoters (WO 01/14572). Examples of synthetic constitutive promoters which must be mentioned are the superpromoter (WO 95/14098) and promoters derived from G-boxes (WO 94/12015). Furthermore, chemically inducible promoters may also be employed under certain circumstances, compare EP-A 388186, EP-A 335528, WO 97/06268. Others which are available for the expression of genes in plants are leaf-specific promoters as described in DE-A 19644478 or light-regulated promoters such as, for example the petE promoter from pea.

Among the polyadenylation signals, the poly-A addition sequence form the Agrobacterium tumefaciens ocs or nos gene may be mentioned, in particular.

Further regulatory sequences which may be expedient also include sequences which govern the transport and/or the localization of the expression products (targeting). Sequences which must be mentioned in particular in this context are the signal-peptide- or transit-peptide-encoding sequences which are known per se. For example, the expression product can successfully be addressed to the plastids of a plant cell with the aid of plastid-transit-peptide-encoding sequences.

Plants which are preferred as recipient plants are in particular those which are capable of being transformed in an expedient manner. They include monocotyledonous and dicotelydonous plants. Plants which must be mentioned in particular are agriculturally useful plants such as cereals and grasses, for example Triticum spp., Zea mais, Hordeum vulgare, Hafer, Secale cereale, Oryza sativa, Pennisetum glaucum, Sorghum bicolor, Triticale, Agrostis spp., Cenchrus ciliaris, Dactylis glomerata, Festuca arundinacea, Lolium spp., Medicago spp. and Saccharum spp., pulses and oil crops, for example Brassica juncea, Brassica napus, Glycine max, Arachis hypogaea, Gossypium hirsutum, Cicer arietinum, Helianthus annuus, Lens culinaris, Linum usitatissimum, Sinapis alba, Trifolium repens and

Vicia narbonensis, vegetables and fruits, for example bananas, grapes, Lycopersicon esculentum, asparagus, cabbage, water melons, kiwi fruit, Solanum tuberosum, Beta vulgaris, cassava and chicory, trees, for example Coffea species, Citrus spp., Eucalyptus spp., Picea spp., Pinus spp. and Populus spp., medicinal plants and trees, and flowers.

In accordance with a particular embodiment, the present invention relates to transgenic plants of the genus Arabidopsis, for example Arabidopsis thaliana and the genus Oryza, for example Oryza sativa.

Especially preferred populations according to the invention of transgenic plants are based on a population of recipient plants which is essentially homogeneous. Accordingly, the essential distinguishing feature of individuals of this population is the integration of one or more codogenic gene segments of a donor organism. However, an in-planta transformation which has been carried out with one or more codogenic gene segments results, as a rule, in several transgenic plants which then, together with the respective direct progeny, form lines of one or more codogenic gene segments (multiline). A certain variability, which can essentially be attributed to differences in the transformation process, can result between two lines, i.e. individual transgenic plants and their progeny, in whose genome the same codogenic gene segment(s) of a donor organism are integrated. For example, differences between the lines with regard to integration sites and/or copy numbers may lead to quantitatively different expression of the codogenic gene segment(s).

In accordance with a particular embodiment, populations according to the invention encompass at least 5, at least 10 or at least 20 lines per codogenic gene segment.

In particular, it is preferred uniformly to choose regulatory sequences and further sequences due to the transformation process for different codogenic gene segments for a particular population of transgenic plants, so that differences within the population can essentially be contributed to the different codogenic gene segments.

Accordingly, transgenic plants according to the invention comprise at least one transgenic cell which encompasses one or more codogenic gene segments of a donor organism and is advantageously capable of expressing the same. Preferably, essentially all the cells of a transgenic plant according to the invention are transgenic.

The term “transgenic plants” which is used in accordance with the invention also refers to the progeny of a transgenic plant, for example the T₁, T₂, T₃ and subsequent plant generations, or the BC₁, BC₂, BC₃ and subsequent plant generations. Thus, the transgenic plants according to the invention can be raised and selfed or crossed with other individuals in order to obtain further transgenic plants according to the invention. Transgenic plants can also be obtained by means of vegetative propagation of transgenic plant cells.

The present invention also relates to transgenic plant material which can be derived from a population according to the invention of transgenic plants. Such material includes plant cells and specific tissues, organs and parts of plants in all their manifestations, such as seeds, leaves, anthers, fibers, roots, root hairs, stems, embryos, calli, cotyledons, petioles, crops, plant tissue, reproductive tissue and cell cultures which is derived from the actual transgenic plant and/or can be used for bringing about the transgenic plant. Also, such material encompasses at least 50% of all codogenic gene segments of the donor organism in question.

The present invention furthermore relates to a method for generating a population of transgenic plants in which

-   a) at least one codogenic gene segment of a donor organism is     provided; -   b) this codogenic gene segment, if appropriate in combination with a     further codogenic gene segment or several further codogenic gene     segments of the donor organism, is integrated into the genome of at     least one plant; and -   c) steps a) and b) are carried out for essentially all of the     codogenic gene segments of the donor organism.

A codogenic gene segment can expediently be provided in a manner known per se, for example, by customary cloning. In particular, genomic DNA or mRNA can be amplified by means of cDNA of the donor organism. The first-mentioned procedure is the method of choice in the case of donor organisms whose genome has no introns. The last-mentioned procedure is expedient when the genome of the donor organism has introns and these introns are not to be integrated together with the codogenic gene segment in the genome of the plant. This gives a nucleic acid sequence containing the desired codogenic gene segment.

For further cloning, the sequence thus obtained, which contains the codogenic gene segment, is, as a rule, inserted into a suitable cloning vector. Suitable vectors are above all those which are capable of replication in suitable intermediate hosts such as bacterial host cells, for example Bacillus, Streptomyces, Salmonella and, above all Escherichia coli. The skilled worker is familiar with a multiplicity of suitable vectors and manipulations such as restriction, insertion, ligation, deletion and the like are also part of the expert knowledge, so that the desired sequence which contains the codogenic gene segment, if appropriate together with regulatory elements and/or further elements required for transformation and expression, is, as a rule, successfully provided.

In accordance with a preferred embodiment, a codogenic gene segment is provided by cloning it in a ligation-independent fashion. The advantage of this procedure is that the codogenic gene segments are neither cut by the earlier use of restriction endonucleases nor cloned in an undirected fashion via blunt ends or sticky ends, and the vector itself has no religation potential, a high cloning efficacy thus being achieved.

Particular procedures which can be followed for this purpose are

-   a1) to amplify the codogenic gene segment with random incorporation     of an amount of thio-dNTPs; -   a2) to allow a 3′-5′-exonuclease to act on the amplificate so that     fragments with single-stranded ends are formed; and -   a3) to incubate under annealing conditions the fragments resulting     from step a2) with a vector whose single-stranded ends are at least     in part complementary to the single-stranded ends of the fragments.

The amplification itself can be carried out in a manner known per se.

Preferably, a procedure is followed which matches the protocol for Pfu DNA polymerase or a Pfu/Taq DNA polymerase mixture.

The primers are chosen to match the sequence to be amplified.

The primers should expediently be chosen in such a way that the amplificate encompasses the entire codogenic sequence from the start codon to the stop codon. It is preferred to employ chimeras whose 5′ ends are formed in each case by a universal adaptor sequence and whose 3′ ends are formed in each case by a specific sequence of the codogenic gene segment. If the universal adaptor sequences for forward and reverse primers are different, directed cloning into the expression vector is possible.

Following the amplification, the amplificates are expediently analyzed. For example, separation by gel electrophoresis can be followed by qualitative and quantitative analysis.

If dNTPs are employed as a mixture with thio-dNTPs for the amplification, a ratio of 125:1 has proved advantageous, i.e., in theory, a thio-dNTP is incorporated instead of a dNTP at every 125^(th) position of the amplificate.

The incorporation of thio-dNTPs makes it possible to modify the amplificates with the aid of exonucleases. In particular, the amplified amplificates which, as a rule, have double-stranded ends, can be incubated with the enzyme exonuclease III. The enzyme has 3′-5′-exonuclease activity, so that the double-stranded DNA ends of the amplificates are degraded starting at the 3′ end as a function of amplificate quantity, temperature, time and enzyme quantity. What remains is the counterstrand with the resistant 5′ end. Since this enzyme is not capable of breaking down thio-dNTPs, the degradation stops at the first thionucleotide which has been incorporated. Owing to the incorporation of the thio-dNTPs, the amplificates are thus only subjected to limited degradation, independently of their size and DNA concentration.

Thereafter, the modified amplificates can be purified following a standard protocol (for example Qiagen). An aliquot of the purified amplificates is then available for the subsequent cloning, which, according to what has been said above, can be performed in particular in a ligation-independent fashion.

Suitable cloning vectors are generally known in expert circles. They include, in particular, vectors which are capable of replication in bacterial systems, i.e. above all vectors which ensure efficient cloning in E. coli and which make possible the stable transformation of plants. Those which must be mentioned in particular are various binary and co-integrated vector systems which are suitable for T-DNA-mediated transformation. As a rule, such vector systems are characterized in that they contain at least the vir genes required for agrobacterium-mediated transformation and the sequences which delimit the T-DNA (T-DNA border). These vector systems preferably also encompass further cis-regulatory regions such as promoters and terminators and/or selection markers by means of which suitably transformed organisms can be identified. While vir genes and T-DNA sequences are arranged on the same vector in co-integrated vector systems, binary systems are based on at least two vectors of which one harbors vir genes, but no T-DNA, while a second vector harbors T-DNA, but no vir gene. The last-mentioned vectors are therefore relatively small, easy to manipulate and capable of replication both in E-coli and in agrobacterium. These binary vectors include vectors from the series pBIB-HYG, pPZP, pBecks, pGreen. Vectors which are-preferably used in accordance with the invention are Bin19, pBl101, pBinAR, pGPTV and pCAMBIA. Binary vectors and their use are reviewed by Hellens et al, Trends in Plant Science (2000) 5, 446-451.

To prepare the vectors, the vectors can first be linearized with restriction endonuclease(s) and then enzymatically modified in a suitable manner. Another enzyme which is suitable for this purpose and which has 3′-5′-exonuclease activity is T4-DNA polymerase, which degrades the double-stranded DNA ends of the linearized vector into single strands as a function of vector quantity, temperature, time and enzyme quantity. The vector is subsequently purified, and in each case one aliquot is employed for the ligation-independent cloning.

For ligation-independent cloning, the enzymatically modified and, if appropriate, purified amplificates are cloned with similarly prepared vector fragments without the use of ligase. Instead of this enzyme, which is conventionally used, the annealing potential of the single-stranded ends of the modified amplificates and of the single-stranded ends of the vector, which have compatible adapter sequences, are exploited. This gives vectors which encompass the desired codogenic gene segment(s) (plasmid constructs).

The present invention therefore also relates to collections of plasmid constructs in which at least 50% of all of the codogenic gene segments of a donor organism are integrated. Here, specific plasmid constructs can have one or else more than one codogenic gene segments. The codogenic gene segments in these plasmid constructs are preferably operably linked with regulatory sequences. The regulatory sequences include, in particular, plant sequences like the above-described promoters and terminators. Advantageously, the plasmid constructs can stably be propagated in microorganisms, in particular Escherichia coli and Agrobacterium tumefaciens, under selective conditions, and make possible the transfer of heterologous DNA into plants.

In accordance with a particular embodiment, the plasmid constructs according to the invention are based on binary vectors (review of binary vectors in Hellens et al., 2000). As a rule, they contain prokaryotic regulatory sequences such as replication origin and selection markers for the propagation in microorganisms such as Escherichia coli and Agrobacterium tumefaciens, and agrobacterial T-DNA sequences to transfer DNA into plant genomes. At least the right border sequence, which encompasses approximately 25 base pairs, of the entire agrobacterial T-DNA sequence is required. The vector constructs according to the invention usually contain T-DNA sequences from both the right and the left border region, which contain expedient recognition sites for site-specifically acting enzymes which, in turn, are encoded by some of the vir genes.

The present invention moreover also relates to collections of host organisms which contain the collections according to the invention of plasmid constructs. In this sense, the host organisms are transformed with the plasmid constructs according to the invention. Suitable host organisms are known to the skilled worker. They include above all bacterial hosts of which some have already been mentioned above in context with donor microorganisms, for example bacteria from the genera Bacillus Streptomyces, Salmonella, and the like. Host organisms which are preferred in accordance with the invention are those from the genus Escherichia, in particular Escherichia coli, and Agrobakterium, in particular Agrobacterium tumefaciens.

Moreover, what has been said about the plant population also applies analogously to the collections according to the invention of plasmid constructs.

The resulting plasmid constructs can subsequently be transferred into a suitable intermediate host, for example a bacterium, for verification purposes. The transformation into E. coli, which can be carried out in a manner known per se, for example by means of heat shock or electroporation, has proved expedient in this context.

The transformed E. coli colonies can thus be analyzed for cloning efficacy. This can be carried out with the aid of a PCR. In such a procedure, both the identity and the integrity of the plasmid construct can be verified using a defined number of colonies by subjecting an aliquot of the colonies to said PCR. To do so, universal primers which are derived from vector sequences are generally employed, the forward primer being arranged upstream of the start ATG and the reverse primer downstream of the stop codon of the codogenic gene segment. The amplificates are separated by electrophoresis and assessed for quantity and quality. If a fragment with a suitable size is detected, the assessment is positive.

The plasmid constructs which have optionally been verified are subsequently used for transforming the plants. To this end, it may first be necessary to obtain the constructs from the intermediate hosts. For example, the constructs can be obtained from bacterial hosts in the form of plasmids using a procedure analogous to conventional plasmid isolation.

A large number of methods for transforming plants are known. Since stable integration of heterologous DNA into the genome of plants is advantageous according to the invention, T-DNA-mediated transformation in particular has proved to be expedient.

To do so, it is first necessary to transform suitable vehicles, in particular agrobacteria, with the codogenic gene segment or the corresponding plasmid construct. This can be done in a manner known per se. For example, the plasmid construct which has been generated according to what has been said above can be transformed into competent agrobacteria by means of electroporation or heat shock.

In principle, one must distinguish between the formation of co-integrated vectors and the transformation with binary vectors. In the first alternative, the vector constructs encompassing the codogenic gene segment have no T-DNA sequences, but the co-integrated vectors are formed in the agrobacteria by homologous recombination of the vector construct with T-DNA. The T-DNA is present in the agrobacteria in the form of Ti or Ri plasmids in which the oncogenes have expediently been replaced by exogenous DNA. If binary vectors are used, they can be transferred to agrobacteria by bacterial conjugation or directly. These agrobacteria expediently already comprise the vector which harbors the vir genes (frequently referred to as helper-Ti(Ri) plasmid).

One or more markers with the aid of which the transformed agrobacteria and transformed plant cells can be selected may also advantageously be used together with the plasmid construct and the T-DNA. A multiplicity of markers have been developed for this purpose. They include for example those which confer resistance to chloramphenicol, kanamycin, the aminoglycoside G418, hygromycin and the like.

If desired, the plasmid constructs can be verified again with regard to identity and/or integrity before they are transformed into agrobacteria. This can be done for example in analogy to the above PCR for verifying the cloning efficacy.

As a rule, it is desired that the plasmid constructs are flanked by T-DNA unilaterally or bilaterally of the codogenic gene segment. This is particularly useful when the bacteria used for the transformation belong to the species Agrobacterium tumefaciens or Agrobacterium rhizogenes.

Preferred in accordance with the invention is the transformation with the aid of Agrobacterium tumefaciens.

The transformed agrobacteria can be cultured in a manner known per se and are thus available for expedient plant transformation.

The plants or plant parts to be transformed are grown or provided in the customary manner. The transformed agrobacteria are subsequently allowed to act on the plants or plant parts until a sufficiently high transformation rate has been achieved.

The agrobacteria can act on the plants or plant parts in different ways.

For example, a culture of morphogenic plant cells or tissue can be used. Subsequently to the T-DNA transfer, the bacteria are eliminated, as a rule by using antibiotics, and the regeneration of plant tissue is induced. This is done in particular using plant hormones so that, after callus has formed initially, the development of shoots is promoted.

It is preferred in accordance with the invention to carry out the transformation in planta. To this end, it is possible, for example, to allow the agrobacteria to act on seeds of plants or to inoculate plant meristem with agrobacteria.

In particular it has proved advantageous according to the invention to allow a suspension of transformed agrobacteria to act on the intact plant or at least the flower primordia. The plant is subsequently grown on until seeds of the treated plant are obtained (Clough and Bent, Plant J. (1998) 16, 735-743).

To select transformed plants, plant material obtained from the transformation process is, as a rule, subjected to selective conditions so that transformed plants can be distinguished from untransformed plants. For example, the seeds which have been obtained in the above-described manner can again be planted and, after cultivation, be subjected to suitable selection by spraying. Another possibility is to sterilize the seeds, if appropriate, and then to grow them on agar plates using a suitable selection agent such that only the transformed seeds can grow into plants.

The present invention also relates to the use of a population according to the invention of transgenic plants and/or a biological material derived therefrom for functional studies.

In accordance with an embodiment, the functional studies concern the metabolism of the transgenic plants. Thus, it is possible to study in particular biochemical parameters, such as the accumulation of specific substrates or products of enzymatic reactions or the expression of endogenous genes of the recipient plant, for example, with the aid of the RNA or protein profile of the plant.

In accordance with a further embodiment, the functional studies concern phenotypic traits of the transgenic plants. Phenotypic traits of interest include, for example, the growth, the color, the morphology or the flowering behavior of the plants and further traits which characterize the phenotype of the plant. Such studies can be carried out by assessing individual cells of the transgenic plant. As a rule, however, such studies are carried out at the multicellular level by assessing organized structures, i.e. in particular the intact plant or plant parts such as leaves, roots and the like. The studies can therefore take the form of macroscopic or microscopic studies.

In accordance with a particular embodiment, at least one trait selected from among germination behavior, number of cotyledons, cotyledon area, number of rosette leaves, rosette leaf area, number of stomata on the rosette leaves, shape of the rosette leaves, dry weight of the rosette leaves, plant growth rate, time of shoot formation, branching of the shoot, shoot length, number of lateral shoots, number of flowers, flower size, flowering time, inflorescence shape, pollen size, color and number of pollen, seed size, seed color, seed shape, relative seed weight (based on seed volume) and absolute seed weight, total number of seeds, number of seeds per pod, pod length, number of pods per plant, time of seed maturation, root length, total root weight, root branching, number and length of root hairs, onset of senescence, duration of senescence and plant pigmentation (chlorophyll content) is studied.

In accordance with a particular aspect of the present invention, the functional studies are carried out on plants which are exposed to particular environmental conditions. These include abiotic stresses such as low-temperature stress, frost stress, drought stress, salt stress, ozone stress, CO₂ stress, light stress, oxidative stress, high-temperature stress, anoxic stress, heavy metals, ionizing/UV radiation, nutrient deficiency (for example N, P, K, microelements and the like), biotic stresses such as plant pathogens, for example fungi, plasmodia, bacteria and viruses, plant pathogens such as nematodes, protozoans, slugs, snails and insects, furthermore higher parasitic plants, and biological interactions such as symbioses.

The functional determination of the donor organism's codogenic gene segments in the recipient is of particular importance. This application applies both to codogenic gene segments whose function in the donor organism is known and to codogenic gene segments whose function in the donor organism is insufficiently elucidated, if at all. In this context, the function may already be exerted by the transcribed RNA, for example in the sense of antisense suppression or cosuppression, or by a regulatory RNA. In most cases, however, the function relates to the translated protein.

The functional determination can be carried out for example based on the above-described metabolic or phenotypic modifications.

As a rule, the use according to the invention for carrying out functional studies includes a method in which

-   a) the population according to the invention of transgenic plants is     provided; -   b) the desired functional study is carried out; and -   c) the result of the study is compared with a result of a study     obtained analogously with at least one reference plant.

The reference plant is characterized in that none of the codogenic gene segments encompassed by the population according to the invention is integrated into its genome or is at least not expressed therein. In this sense, the reference plant is to be referred to as the wild type from which the transgenic plants of the population according to the invention differ by the integration and in particular the expression of at least one codogenic gene segment.

If the comparison reveals a deviation, it can be concluded that the recipient plant has been the subject of a functional modification which correlates with the expression of one or more specific codogenic gene segments of a donor organism.

EXAMPLES

The present invention is hereinbelow described in greater detail with reference to examples.

Example 1 Generation of the Population of Transgenic Plants

The generation of a population according to the invention of transgenic plants is described with reference to Arabidopsis thaliana into whose genome the codogenic gene segments from Saccharomyces cerevisiae are integrated. By way of example, specific information is given for the YKL174C gene and the YKR009C gene. As a rule, the remaining codogenic gene segments are integrated analogously or, if necessary, in suitably modified form.

1. Generation of Transformed Agrobacteria

To generate the population according to the invention of transgenic plants, the codogenic gene segments to be integrated are initially transferred to agrobacteria. The process chain used for this purpose, which encompasses eight steps, permits efficaceous amplification and directed cloning of the amplificates into constitutive expression vectors with plant selection markers and their transformation into agrobacteria of a high-throughput process.

The process chain is divided into the following eight process steps: amplification (1), modification of the amplificate (2), vector preparation (3), ligation-independent cloning (4), transformation of E.coli (5), PCR for verifying the cloning efficacy (6), plasmid preparation (7) and transformation of agrobacteria (8). Unless otherwise specified, standard methods of Sambrook et al., Molecular Cloning: A laboratory manual, Cold Spring Harbor 1989, Cold Spring Harbor Laboratory Press are used.

1.1. Amplification

The amplification is carried out in a 96-well thermoplate in accordance with the Pfu Turbo or Herculase DNA polymerase protocol (Stratagene). The composition is as follows: 1× PCR buffer [20 mM Tris-HCl (pH 8.8), 2 mM MgSO₄, 10 mM KCl, 10 mM (NH₄)SO₄, 0.1% Triton X-100, 0.1 mg/ml BSA], 0.2 mM α-Thio-dNTP and dNTP (1:125), 100 ng genomic DNA of Saccharomyces cerevisiae (strain S288C; Research Genetics, Inc., now Invitrogen), 50 pmol forward primer, 50 pmol reverse primer, 2.5 U Pfu or Herculase DNA polymerase. The amplification cycles are as follows: 1 cycle for 3 min at 94° C., followed by 25-30 cycles of in each case 30 sec at 94° C., 30 sec at 55° C. and 5-6 min at 72° C., followed by 1 cycle of 7-10 min at 72° C., then 4° C.∞. The amplification products are separated electrophoretically and assessed with regard to quantity and quality. If a fragment in the correct size is detected, the assessment is positive.

The following primer sequences are selected:

For the YKL174C gene:

1) forward primer (SEQ ID NO: 1) 5′-GGAATTCCAGCTGACCACCATGCCAGAGTATACGCTACTGGC 2) reverse primer (SEQ ID NO: 2) 5′-ATCCCCGGGAATTGCCATGTCATATATCATATCTACGATCATGG For the YKR009C gene:

1) forward primer (SEQ ID NO: 3) 5′-GGAATTCCAGCTGACCACCATGCCTGGAAATTTATCCTTCAAAG 2) reverse primer (SEQ ID NO: 4) 5′-ATCCCCGGGAATTGCCATGTTATAGTTTAGATTTTGCCTGCGATA 1.2. Modification of the Amplificate

Treatment with exonuclease III is effected in the same 96-well thermoplate as the amplification reaction above, by adding 10 U exonuclease III (MBI-Fermentas). The reactions are incubated for 10 minutes at 20° C. in a cycler and stopped by addition of high-salt buffer from the Qiaquick Purification Kit (Qiagen). The purification is performed as specified in the Qiaquick standard protocol (Qiagen).

1.3. Vector Preparation

The restriction with NcoI and its inhibition is carried out using 30 μg of vector DNA following the NcoI protocol (MBI-Fermentas). A binary vector which, between the T-DNA border sequences, comprises a selection cassette (promoter, selection marker, terminator) and an expression cassette with promoter, cloning cassette and terminator sequence is used. The binary vector has no further NcoI cleavage sites except for the one in the cloning cassette. The cloning cassette consists of the following sequence:

5′-GGAATTCCAGCTGACCACCATGGCAATTCCCGGGGATC-3′ (SEQ ID NO: 5). The nuclease reaction is treated with 1 U T4-DNA polymerase and then incubated for 2 min at 37° C. and stopped by addition of high-salt buffer. The linearized and modified vector fragments are purified via NUCLEOBOND nucleic acid purification columns following the standard protocol (Machery-Nagel).

1.4. Ligation-Independent Cloning

Approx. 30 ng of prepared vector (as per 1.3.) and a defined amount of modified amplificate (approx. 80 ng, as per 1.2) are mixed in a 96-well thermoplate.

Annealing is performed in the cycler as follows: 1 cycle for 15 min at 65° C., cooling to 37° C. (0.1° C./1 sec), 1 cycle for 10 min at 37° C., cooling to 4° C. (0.1° C./1 sec), then 4° C. ∞.

1.5. Transformation of Escherichia coli

The transformation is carried out in the same 96-well thermoplate as the ligation-independent cloning above by adding competent E. coli cells (strain DH5α) and incubation in the cycler for 20 min at 1° C., followed by a heat shock for 90 seconds at 42° C and cooling to 4° C. This is followed by the addition of complete medium (SOC) and transfer into a 96-well deepwell plate with incubation for 45 min at 37° C. The entire reaction is subsequently plated onto agar plates supplemented with kanamycin and incubated overnight at 37° C.

1.6. PCR for Verifying the Cloning Efficacy

The amplification is carried out in a 96-well thermoplate following the protocol for Taq DNA polymerase (Gibco-BRL). The composition is as follows: 1× PCR buffer [20 mM Tris-HCL (pH 8.4), 1.5 mM MgCl₂, 50 mM KCl, 0.2 mM dNTP, 5 pmol forward primer, 5 pmol reverse primer, 0.625 U Taq DNA polymerase. A defined number of plasmid constructs is picked from each agar plate, and in each case one aliquot is transferred into one well, charged with the PCR mastermix, of a 96-well thermoplate. The amplification cycles are as follows: 1 cycle for 5 min at 94° C., followed by 35 cycles with in each case 15 sec at 94° C., 15 sec at 66° C. and 5 min at 72° C., followed by 1 cycle for 10 min at 72° C., then 4° C. ∞.

As a rule, four different colonies are analyzed per codogenic plasmid construct. To this end, the colonies in question are taken up with a tip of a pipette and transferred into the solution for the PCR.

Control primers which bind upstream and downstream of the cloning cassettes and thus make possible the amplification of the insert are used.

The amplification products are separated electrophoretically and assessed for quality. If a PCR fragment with the respective size is detected, the assessment is positive. It is always the colony of the first positive PCR fragment from each group of four colonies which is taken into the subsequent plasmid preparation step.

1.7. Plasmid Preparation

An aliquot of positive colonies is transferred into a well, charged with complete medium (LB) and kanamycin, of a deepwell plate and incubated overnight at 37° C.

The plasmid preparation is carried out as specified in the QIAPREP nucleic acid purification column standard protocol (Qiagen).

1.8. Transformation of Agrobacteria

1 ng of the isolated plasmid DNA is transformed into competent Agrobacterium tumefaciens cells strain GV 3101 pMP90 (Koncz and Schell, Mol. Gen. Gent. 204, 383-396, 1986) in a 96-well thermoplate by means of electroporation. Then, complete medium (YEP) is added and the reaction is transferred into a 96-well deepwell plate with incubation for 3 hours at 28° C. Thereafter, the entire reaction is plated onto YEP agar plates with kanamycin and incubated for 48 hours at 28° C.

2. Plant Transformation and Selection

The plasmid constructs present in the agrobacteria generated as specified in 1, are then available for transforming plants.

2.1. Bacterial Culture

With the aid of a pipette tip, one colony is picked from the agar plate and taken up in 3 ml of liquid TB medium which additionally contains kanamycin, rifampicin and gentamycin. The preculture grows for 48 hours at 28° C. and 120 rpm.

400 ml of LB medium which likewise contains kanamycin and gentamycin are used for the main culture. The preculture is transferred into the main culture. The main culture grows for 18 hours at 28° C. aund 120 rpm. Following centrifugation at 4000 rpm, the pellet is resuspended in infiltration medium (MS medium, 10% sucrose).

2.2. Plant Culture

Dishes (green Piki Saat 80 dishes with perforated bottom, 30×20×4.5 cm, Wiesauplast, Kunststofftechnik, Germany) are half-filled with a GS 90 substrate (standard soil, Werkverband E.V., Germany). The dishes are watered overnight with 0.05% PREVICUR solution (PREVICUR N, fungicide against mildew, Aventis CropScience). Arabidopsis thaliana C24 seeds are broadcast over the dish, approximately 1000 seeds per dish. The dishes are covered with a hood and placed into the stratification chamber (8 h, 110 μE, 22° C.; 16 hours in the dark at 6° C.). After 5 days, the dishes are placed into the short-day phytotron (8 h 130 μE, 22° C.; 16 hours in the dark at 20° C.). Here, they remain for approximately 10 days until the first true leaves have formed.

The seedlings are transferred into pots containing the same substrate (Teku pots, 10 cmØ, series LC, manufacturer: Poppelmann GmbH&Co, Germany). Nine plants are pricked out into each pot. The pots are then returned into the short-day phytotron, where the plants continue to grow.

After 10 days, the plants are transferred into the greenhouse cabin (additional illumination, 16 h, 340 μE, 22° C.; 8 h in the dark at 20° C.), where they continue to grow for another 17 days.

2.3. Transformation

6-week-old Arabidopsis plants which are just flowering are dipped for 10 seconds in the above-described agrobacterial suspension. The suspension had previously been treated with 10 μl of SILWETT L77 surfactant (Crompton S.A., Osi Specialties, Switzerland). The respective method is described by Clough and Bent, 1998.

The plants are then laid flat into a humid chamber for 18 hours. Thereafter, the pots are returned to the greenhouse, where the plants continue to grow. The plants remain for another 10 weeks in the greenhouse until the seeds can be harvested.

2.4. Selection

Depending on the resistance marker used for the selection of transformed plants, the harvested seeds are planted in the greenhouse and subjected to spray selection, or else sterilized and then grown on agar plates with the respective selection agent. After approx. 10-14 days, the transformed resistant plants differ markedly from the dead wild-type seedlings and can be pricked out into 6-cm pots. To carry out the selection in the greenhouse, seedlings are stratified for 3 days and then sprayed with an aqueous solution of 4 mg/l Pursuit. This selection is repeated after 3 days and 5 days. After a further 2 days, the transformed, resistant seedlings (plantlets in the four-leaf stage) can be distinguished clearly from the untransformed plantlets. The nontransgenic seedlings are bleached or dead. The transformed resistant plants are pricked out into 6-cm pots, where they are grown and their seeds are obtained. The seeds of the transgenic A. thaliana plants are stored in the freezer (at −20° C.).

In this manner, a population of transgenic plants, or biological material derived therefrom, of the species Arabidopsis thaliana in whose genome in each case one codogenic gene segment from Saccharomyces cerevisiae is integrated is generated.

This population is subjected to functional studies. The phenotypic study under normal conditions and stress conditions is used as an example.

3. Description of the Morphological Analyses.

3.1. Preparation of the Plant Material

The seeds are planted in plastic pots with a diameter of 60 mm. For the soil mixture, a mixture of GS-90 substrate and quartz sand (4:1 v/v) is mixed in the potting machine and filled into the pots. Thereafter, 35 pots are placed together into a dish and treated with PREVICUR. To carry out treatment, 25 ml of PREVICUR are taken up in 10 l of tap water. This quantity was sufficient for treating approximately 200 pots. The pots are placed into the PREVICUR solution and additionally provided with overhead irrigation with tap water without PREVICUR. The seeds are planted on the same day.

For planting, the seeds which are stored in the freezer (at −20 ° C.) are removed from the reaction vessels with the aid of a toothpick and transferred into the pots which contain the soil mixture. In total, approximately 5-12 seeds are distributed in the middle of the pot.

After planting, the dishes together with the pots are covered with a matching plastic hood and placed into the growth cabinet at 16 h light (20° C.) and for 3-4 days in the dark (4° C.). The humidity level is approximately 80-90%. After the stratification, the test plants are grown for 21 days in a 16 h light/8 h dark photoperiod at 20° C. and an atmospheric humidity of 60%. OSRAM tubes type VIALOX high-pressure sodium fluorescence tube (manufactured by Osram, Germany), which generate light with the color SON-T PLUS at an intensity of 220 μE/m2/s were used as the light source.

After the stratification, the 10-day-old seedlings are singled out. The plants which show the best growth in the middle of the pot are considered the target plants. All the remaining plants are carefully removed with the aid of metal tweezers and discarded.

During their growth, the plants are irrigated twice overhead with tap water (directed at the soil).

3.2. Morphological Analysis

The 21-day-old test plants are assessed using a developmental key. All the morphological deviations from the reference plant (non-genetically-modified wild type) are recorded. The developmental key groups all of the morphological deviations of the Arabidopsis plants into six main groups. Pigment mutants, cabbage head mutants, dwarf mutants, foliar mutants, rosette mutants and flowering mutants are distinguished. In the main group in question, between two and five different phenotypes are recorded. Thus, for example, albino/yellow mutants, fusca mutants and dark-green mutants are distinguished in the main group of the pigment mutants. In addition to the main traits, other traits such as shape of the leaves, the degree of hirsuitness of the leaves, plant size and flowering time are recorded in the main group in question.

With the aid of the developmental key, a plurality of lines in which a marked morphological deviation to the phenotype of the wild type is observed are detected within the population.

Thus, early flowering is observed in lines (2510) which express the YKL174C gene (similarity to choline transport protein HNM1P). These transgenic plants already flower at an age of 25 days. In contrast, the non-transgenic wild-type plants flower at an age of as much as 30 days or more under identical growth conditions. In addition to the phenomenon of early flowering, other morphological traits such as dark-green pigmentation and dwarfism can be identified with the aid of the key.

In lines which express the YKR009C gene (hydratase-dehydrogenase-epimerase, peroxisomal), dwarfism is observed on day 21 post-germination. The rosette diameter of the transgenic plants measures approximately 15 mm, while the rosette diameter of a wild type of equal age measures approximately 40 mm. Also, pigmentation is markedly darker, which can be attributed to an increased chlorophyll content.

Phenotypic modifications are also observed in further lines which express specific, other codogenic gene segments, while no phenotypic modifications in the transgenic plants were observed for the expression of the remaining codogenic gene segments in the present model.

3.3. Drought Stress Analysis

After the morphological traits were studied, the test plants were exposed to the drought stress test. To ensure identical drought stress conditions for all of the test plants, the test plants are watered generously before being exposed to the drought stress. This measure is intended to ensure as much homogeneity as possible with regard to the moisture content at the beginning of the test. To minimize light damage, the light intensity is reduced to approximately 150 μEm⁻²s⁻¹ during the test. The relative atmospheric humidity is reduced from 60% to 20% at 20° C. The relative atmospheric humidity is reduced stepwise by 10% per day. To rule out positional effects, the trays holding the test plants are rotated in the chamber over the entire test period.

The plants which display sensitivity to the stressor (which are sensitive in comparison with the wild type) are recorded on days 7, 8 and 9 after the onset of the stress. The old leaves of the drought-sensitive plant show yellowish or pale green discoloration and, having wilted, lie on the ground. The younger leaves, in contrast, are curled and resemble the healthy leaves in color. The size of the drought-sensitive plants, in contrast, corresponds approximately to that of the wild type. On day 9 after the onset of the stress, the drought-sensitive plants are desiccated and brittle.

Plants which show drought resistance in comparison with wild-type reference plants are observed daily between days 11 and 14 after the onset of the stress. The observations are recorded. The phenotype of the drought-resistant plants does not differ from the phenotype of a non-stressed plant. The wild-type plants, in contrast, are desiccated after day 10.

Thus, the expression of specific codogenic gene segments from Saccharomyces cerevisiae (in particular the ORFs YDR51w and YER174c encoding glutaredoxin and thioredoxin) leads to improved drought resistance in Arabidopsis thaliana in comparison with the wild type, while the expression of the remaining codogenic gene segments has no effect on drought resistance.

3.4. Frost Stress Analysis.

To carry out the frost stress analysis, the test plants are raised as described above. At the age of 21 days, they are exposed to the frost stress test. The light intensity during the test is reduced to approximately 50 μEm⁻²s⁻¹ in order to avoid light damage during the stress test. The relative atmospheric humidity is not modified during the test. The frost stress treatment is carried out at the following temperatures: first 10 hours at −2° C. and subsequently 48 hours at −6° C. The transition from 20° C. to −2° C., from −2° C. to −6° C. and from −6° C. to 20° C. takes in each case 4 hours. These stress conditions are sufficient to kill the wild-type plants. 3 days after the frost stress test has ended, the test plants are analyzed for the presence of resistance reactions.

In the frost-resistant lines, it is observed that growth continues for 3 days after the stress has ended. Only occasional frost damage, if any, is observed. In most cases, this is limited to the oldest leaves. Frost damage is characterized by desiccation and discoloration of the damaged parts, in particular the leaves. The discolored leaves are pale green or yellow.

Thus, the expression specific codogenic gene segments from Saccharomyces cerevisiae leads to improved frost resistance in Arabidopsis thaliana in comparison with the wild type, while the expression of the remaining codogenic gene segments has no effect on frost resistance.

4. Plant Cultivation for Bioanalytical Studies.

The bioanalytical study of the plants requires that the plants are grown rapidly and uniformly.

For the soil mixture, a mixture of GS-90 substrate and quartz sand (4:1 v/v) is mixed in the potting machine (Laible System GmbH, Singen, Germany) and filled into the pots. Thereafter, 35 pots are placed together into a dish and treated with PREVICUR. To carry out treatment, 25 ml of PREVICUR are taken up in 10 l of tap water. This quantity was sufficient for treating approximately 200 pots. The pots are placed into the PREVICUR solution and additionally provided with overhead irrigation with tap water without PREVICUR. The seeds are planted on the same day.

For planting, the seeds which are stored in the freezer (at −20° C.) are removed from the Eppendorf tubes with the aid of a toothpick and transferred into the pots which contain the soil mixture. In total, approximately 5-12 seeds are distributed in the middle of the pot.

After planting, the dishes together with the pots are covered with a matching plastic hood and placed into the growth cabinet at 16 h light (20° C.) and 8 hours darkness at 4° C. The humidity is approximately 80-90% and the time of the treatment is 72 hours. After the stratification, the test plants are cultivated for 21 days at a 16-hour-light/8-hour-dark photoperiod at 20° C., an atmospheric humidity of 60% and a CO2 concentration of 400 ppm. OSRAM daylight lamps type POWERSTAR HQI-T 250 W/D, which generate a light resembling the solar spectrum at a light intensity of 220 μE/m2/s are used as the light source.

After the stratification, the 10-day-old seedlings are singled out. The plants which show the best growth in the middle of the pot are considered the target plants. All the remaining plants are carefully removed with the aid of metal tweezers and discarded.

During their growth, the plants are subjected twice daily to overhead irrigation with tap water (directed at the plants/pots). After approximately 22 days, the plants, which weigh approximately 300-400 mg, are harvested for the analysis.

5. Metabolic Analysis

The plant material is harvested rapidly and shock-frozen in liquid nitrogen, freeze-dried and then extracted by means of accelerated solvent extraction (ASE) using methanol/water and methanol/dichloromethane. Following liquid-liquid-partition, aliquots of the organic phase and the aqueous phase are evaporated to dryness. In the case of the organic phase, the derivatization comprises transmethylation followed by methoxyamination by reaction with methoxyamine hydrochloride and trimethylsilylation with MSTFA (N-methyl-N-(trimethylsilyl)-trifluoroacetamide). The methanolic/aqueous phase is derivatized only by means of methoxyamination and trimethylsilylation. Final determination of the analytes is carried out by GC/EI-MS. To carry out an LC-MS/MS, the extracts, which have been evaporated to dryness, are only taken up in mobile phase and then analyzed.

In this way the plant material can be investigated, for example, for the amount therein of the amino acids methionine and threonine. For this purpose the methionine content and threonine content are determined by liquid and gas chromatography (LC and GC) for each transgenic plant in the above-described population (Arabidopsis thaliana with codogenic gene segments from Saccharomyces cerevisiae) in each case on 15 transgenic lines and on further plant samples, serving for comparison, from wild-type plants (Arabidopsis thaliana).

The chromatograms are evaluated by dividing the measured peak areas or peak heights of the analyte under determination (methionine or threonine) by the peak area or peak height of the respective internal standard. These values are standardized to the fresh weight determined for the plant. The value for each transgenic plant is then related to the respective wild-type control group, by dividing it by the mean of the 5 values obtained for the wild-type plants in question (factor 1).

Additionally, the value for a transgenic plant with a defined ORF is related to the mean of the values of all other 14 transgenic plants in whose genome the ORF has not been integrated, and the values of the 5 wild-type controls (factor 2).

The table below lists these values from two experiments, conducted independently of one another, for the plants in whose genome the YEL046C ORF from Saccharomyces cerevisiae has been integrated.

Analyte Factor 1 Factor 2 GC/LC Methionine 3.46-3.58 3.31-3.4 LC Threonine 0.45-0.15 0.61-0.15 LC Threonine 0.17-0.16 0.18-0.16 GC Methionine 3.31-3.67  3.5-3.53 GC

Accordingly, those transgenic plants in whose genome the YEL046C ORF, encoding threonine aldolase, has been integrated, exhibit a significantly increased methionine content and a significantly decreased threonine content. This applies not only to the comparison with wild-type plants (factor 1) but also to the comparison with the other transgenic plants in the population investigated, in whose genome the said ORF has not been integrated (factor 2). 

1. A population of transgenic plants that consists of a plurality of transgenic plants of which each plant has integrated into its genome one or more than one copy of one codogenic gene segment of a cellular donor organism that encodes one protein of the cellular donor organism, but has integrated no other codogenic gene segment encoding another protein of the cellular donor organism, wherein the integrated codogenic gene segment comprises a start codon and a stop codon that define an open reading frame encoding the protein; wherein the integrated codogenic gene segment is operably linked to a regulatory sequence so that the open reading frame is transcribed and translated; wherein said plurality of transgenic plants as a whole comprises at least 50% of all codogenic gene segments that encode a protein of the cellular donor organism; and wherein the plurality of transgenic plants is obtained by a) providing a known codogenic gene segment that encodes one protein of a cellular donor organism and integrating one or more than one copy of the codogenic gene segment into the genome of a plant; b) providing another known codogenic gene segment that encodes another protein of the cellular donor organism and integrating one or more than one copy of the other codogenic gene segment into the genome of another plant; c) repeating step b) so that the resulting plurality of transgenic plants as a whole comprises at least 50% of all codogenic gene segments that encode a protein of the cellular donor organism.
 2. The population of claim 1, wherein the codogenic gene segments are integrated into the nuclear genome.
 3. The population of claim 1, wherein 1 to 5 copies of the codogenic gene segments are integrated per plant cell.
 4. The population of claim 1, wherein the codogenic gene segments are unilaterally or bilaterally flanked by a T-DNA sequence.
 5. The population of claim 1, wherein the regulatory sequence contains a signal peptide-encoding sequence, transit-peptide-encoding sequence, or both.
 6. The population of claim 1, wherein the cellular donor organism is a microorganism.
 7. The population of claim 6, wherein the microorganism is a yeast.
 8. The population of claim 7, wherein the yeast is Saccharomyces cerevisiae.
 9. The population of claim 6, wherein the microorganism is an organism of the species Escherichia coli.
 10. The population of claim 1, wherein the transgenic plants belong to the genus Arabidopsis.
 11. The population of claim 1, wherein the transgenic plants belong to the genus Oryza.
 12. The population of claim 1, wherein the product of the codogenic gene segments has a function in metabolism, energy, transcription, protein synthesis, protein processing, cellular transport and transport mechanisms, cellular communication and signal transduction, cell rescue, cell defense and cell virulence, regulation of the cellular environment and interaction of the cell with its environment, cell fat, transposable elements, viral proteins and plasmid proteins, cellular organization control, subcellular localization, regulation of protein activity, proteins with binding function or cofactor requirement, and/or transport facilitation.
 13. A method of generating the population of transgenic plants of claim 1, comprising: a) providing a known codogenic gene segment that encodes one protein of a cellular donor organism and integrating one or more than one copy of the codogenic gene segment into the genome of at least one plant; b) providing another known codogenic gene segment that encodes another protein of the cellular donor organism and integrating one or more than one copy of the other codogenic gene segment into the genome of another plant; and c) repeating step b) so that the resulting plurality of transgenic plants as a whole comprises at least 50% of all codogenic gene segments that encode a protein of the cellular donor organism; wherein each plant contains one or more than one copy of one codogenic gene segment of a cellular donor organism that encodes one protein of the cellular donor organism, but has integrated no other codogenic gene segment encoding another protein of the cellular donor organism; wherein said plurality of transgenic plants as a whole comprises at least 50% of all codogenic gene segments that encode a protein of the cellular donor organism, and wherein the codogenic gene segment comprises a start codon and a stop codon that define an open reading frame encoding the protein so that the open reading frame is transcribed and translated.
 14. The method of claim 13, wherein bacteria from the genus Agrobacterium are transformed with one or more of the codogenic gene segments.
 15. The method of claim 14, wherein the bacterium is Agrobacterium tumefaciens.
 16. The method of claim 13, wherein the codogenic gene segment is integrated into the genome of said at least one plant by: allowing agrobacteria that are transformed with the codogenic gene segments to act on the plant or suitable biological material therefrom; and obtaining the transgenic plant or suitable biological material therefrom.
 17. The method of claim 13, wherein the codogenic gene segment is integrated into the genome of said at least one plant by: allowing agrobacteria that are transformed with the codogenic gene segments to act in planta.
 18. The method of claim 16, further comprising planting seeds of the plants treated with agrobacteria subjecting the seeds to selective conditions, and obtaining transformed plants or suitable biological material therefrom.
 19. A method of generating the population of transgenic plants of claim 1, comprising: providing at least 50% of all of the codogenic gene segments encoding a protein of a cellular donor organism; and integrating said codogenic gene segments encoding a protein into the genome of at least one plant, wherein one or more copies of a single specific codogenic gene segment encoding a protein but no other codogenic gene segment encoding another protein of the cellular donor organism are integrated into a plant, resulting in a plurality of transgenic plants by: a) providing a known codogenic gene segment that encodes one protein of a cellular donor organism and integrating one or more than one copy of the codogenic gene segment into the genome of at least one plant; b) providing another known codogenic gene segment that encodes another protein of the cellular donor organism and integrating one or more than one copy of the other codogenic gene segment into the genome of another plant; and c) repeating step b) so that the resulting plurality of transgenic plants as a whole comprises at least 50% of all codogenic gene segments that encode a protein of a cellular donor organism; wherein said plurality of transgenic plants as a whole comprises at least 50% of all codogenic gene segments that encode a protein of a cellular donor organism, and wherein the integrated codogenic gene segment comprises a start codon and a stop codon that define an open reading frame encoding the protein and is capable of expressing a product.
 20. A method for identifying stress tolerant transgenic plants comprising exposing the population of transgenic plants of claim 1 to stress conditions and identifying the transgenic plants exhibiting stress tolerance when compared to wild-type plants subjected to the same stress conditions.
 21. The method of claim 20, wherein the stress conditions is low-temperature stress or drought stress.
 22. A population of transgenic plants that consists of a plurality of transgenic plants of which each plant has integrated into its genome one or more than one copy of one codogenic gene segment of a cellular donor organism that encodes one protein of the cellular donor organism, but has integrated no other codogenic gene segment encoding another protein of the cellular donor organism, wherein the integrated codogenic gene segment comprises a start codon and a stop codon that define an open reading frame encoding the protein; wherein the integrated codogenic gene segment is operably linked to a regulatory sequence so that the open reading frame is transcribed and translated; wherein said plurality of transgenic plants as a whole comprises at least 50% of all codogenic gene segments that encode a protein of the cellular donor organism; and wherein the plurality of transgenic plants is obtained by a) providing a known codogenic gene segment that encodes one protein of a cellular donor organism and integrating one or more than one copy of the codogenic gene segment into the genome of a plant; b) providing another known codogenic gene segment that encodes another protein of the cellular donor organism and integrating one or more than one copy of the other codogenic gene segment into the genome of another plant; c) repeating step b) so that the resulting plurality of transgenic plants as a whole comprises at least 50% of all codogenic gene segments that encode a protein of the cellular donor organism; wherein the proteins are proteins that are expressed in the cellular donor organism.
 23. A population of transgenic plants that consists of a plurality of transgenic plants of which each plant has integrated into its genome one or more than one copy of one codogenic gene segment of a cellular donor organism that encodes one protein of the cellular donor organism, but has integrated no other codogenic gene segment encoding another protein of the cellular donor organism, wherein the integrated codogenic gene segment comprises a start codon and a stop codon that define an open reading frame encoding the protein; wherein the integrated codogenic gene segment is operably linked to a regulatory sequence so that the open reading frame is transcribed and translated; wherein said plurality of transgenic plants as a whole comprises at least 50% of all codogenic gene segments that encode a protein of the cellular donor organism; and wherein the plurality of transgenic plants is obtained by a) providing a known codogenic gene segment that encodes one protein of a cellular donor organism and integrating one or more than one copy of the codogenic gene segment into the genome of a plant; b) providing another known codogenic gene segment that encodes another protein of the cellular donor organism and integrating one or more than one copy of the other codogenic gene segment into the genome of another plant; c) repeating step b) so that the resulting plurality of transgenic plants as a whole comprises at least 50% of all codogenic gene segments that encode a protein of the cellular donor organism; wherein the proteins are full-length proteins.
 24. The population of claim 1, wherein the protein sequence of the proteins is available from a public gene database selected from the group consisting of the EMBL database, the GenBank database, the PIR database, the SGD database, the MIPS database, the GenProtEC database, and the TAIR database.
 25. The population of claim 1, wherein the cellular donor organism is selected from the group of microorganisms consisting of Acetobacter (subgen. Acetobacter) aceti, Acidithiobacillus ferrooxidans, Acinetobacter sp., Actinobacillus sp., Aeromonas salmonicida, Agrobacterium tumefaciens, Aquifex aeolicus, Arcanobacterium pyogenes, Aster yellows phytoplasma, Bacillus sp., Bifidobacterium sp., Borrelia burgdorferi, Brevibacterium Brucella melitensis, Buchnera sp., Butyrivibrio fibrisolvens, Campylobacter jejuni, Caulobacter crescentus, Chlamydia sp., Chlamydophila sp., Chlorobium limicola, Citrobacter rodentium, Clostridium sp., Comamonas testosteroni, Corynebacterium sp., Coxiella burnetii, Deinococcus radiodurans, Dichelobacter nodosus, Edwardsiella ictaluri, Enterobacter sp., Erysipelothrix rhusiopathiae, Escherichia coli, Flavobacterium sp., Francisella tularensis, Frankia sp. CpI1, Fusobacterium nucleatum, Geobacillus stearothermophilus, Gluconobacter oxydans, Haemophilus sp., Helicobacter pylori, Klebsiella pneumoniae, Lactobacillus sp., Lactococcus lactis, Listeria sp., Mannheimia haemolytica, Mesorhizobium loti, Methylophaga thalassica, Microcystis aeruginosa, Microscilla sp. PRE1, Moraxella sp. TA144, Mycobacterium sp., Mycoplasma sp., Neisseria sp., Nitrosomonas sp., Nostoc sp. PCC 7120, Novosphingobium aromaticivorans, Oenococcus oeni, Pantoea citrea, Pasteurella multocida, Pediococcus pentosaceus, Phormidium foveolarum, Phytoplasma sp., Plectonema boryanum, Prevotella ruminicola, Propionibacterium sp., Proteus vulgaris, Pseudomonas sp., Ralstonia sp., Rhizobium sp., Rhodococcus equi, Rhodothermus marinus, Rickettsia sp., Riemerella anatipestifer, Ruminococcus flavefaciens, Salmonella sp., Selenomonas ruminantium, Serratia entomophila, Shigella sp., Sinorhizobium meliloti, Staphylococcus sp., Streptococcus sp., Streptomyces sp., Synechococcus sp., Synechocystis sp. PCC 6803, Thermotoga maritima, Treponema sp., Ureaplasma urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Xylella fastidiosa, Yersinia sp., and Zymomonas mobilis.
 26. The population of claim 1, where the population comprises at least 3000 different codogenic gene segments from Saccharomyces cerevisiae or at least 3000 different codogenic gene segments from E. coli.
 27. The population of claim 1, wherein the codogenic gene segments include codogenic gene segments selected from the codogenic gene segments shown in FIG. 1 or FIG.
 2. 